A Comprehensive Assessment of Genetic and Epigenetic Alterations Identifies Frequent Variations Impacting Six Prototypic SCF Complex Members

The SKP1, CUL1, F-box protein (SCF) complex represents a family of 69 E3 ubiquitin ligases that poly-ubiquitinate protein substrates marking them for proteolytic degradation via the 26S proteasome. Established SCF complex targets include transcription factors, oncoproteins and tumor suppressors that modulate cell cycle activity and mitotic fidelity. Accordingly, genetic and epigenetic alterations involving SCF complex member genes are expected to adversely impact target regulation and contribute to disease etiology. To gain novel insight into cancer pathogenesis, we determined the prevalence of genetic and epigenetic alterations in six prototypic SCF complex member genes (SKP1, CUL1, RBX1, SKP2, FBXW7 and FBXO5) from patient datasets extracted from The Cancer Genome Atlas (TCGA). Collectively, ~45% of observed SCF complex member mutations are predicted to impact complex structure and/or function in 10 solid tumor types. In addition, the distribution of encoded alterations suggest SCF complex members may exhibit either tumor suppressor or oncogenic mutational profiles in a cancer type dependent manner. Further bioinformatic analyses reveal the potential functional implications of encoded alterations arising from missense mutations by examining predicted deleterious mutations with available crystal structures. The SCF complex also exhibits frequent copy number alterations in a variety of cancer types that generally correspond with mRNA expression levels. Finally, we note that SCF complex member genes are differentially methylated across cancer types, which may effectively phenocopy gene copy number alterations. Collectively, these data show that SCF complex member genes are frequently altered at the genetic and epigenetic levels in many cancer types, which will adversely impact the normal targeting and timely destruction of protein substrates, which may contribute to the development and progression of an extensive array of cancer types.


Introduction
In 2020,~20 million individuals throughout the world were newly diagnosed with cancer, while~10 million succumbed to the disease [1]. Despite these statistics, the molecular determinants (i.e., aberrant genes, proteins and pathways) underlying cancer development and progression remain poorly understood. Accordingly, new insight into the molecular events driving oncogenesis is needed to aid in the development of novel therapeutic strategies aimed at ultimately improving the lives and outcomes of those living with cancer. Decades of biochemical and genetic data have shown that aberrant expression/abundance of key proteins involved in critical biological processes (e.g., cell cycle regulation, DNA damage repair, and apoptosis [2][3][4][5][6][7][8]) are drivers of disease development and progression [9][10][11][12]. In this regard, the ubiquitin-proteasome system, an essential protein degradation system in substrates, which is expected to prevent substrate protein degradation leading to their aberrant accumulation. Beyond mutations, we also discovered that CNAs occur frequently for all six genes and that while CUL1 and SKP2 predominantly exhibit copy number gains, the remaining genes (SKP1, RBX1, FBXW7 and FBXO5) exhibit more copy number losses that correspond with reduced mRNA expression. Collectively, these data suggest that SCF complex members may act as oncogenes and/or tumor suppressor genes in a contextdependent or cancer-specific manner. Finally, we examined the methylation status of all six genes and determined that each is differentially methylated across cancer types, suggesting epigenetic misregulation may also contribute to cancer pathogenesis in a manner like that observed following gene copy number losses. Collectively, these findings provide novel insight into genetic and epigenetic alterations affecting the SCF complex and support the possibility that aberrant expression and/or function of key SCF complex members may be an early etiological event driving the development and progression of many cancer types.

Genes Encoding SCF Complex Members Are Mutated Frequently in Cancer
To determine the mutational frequency and types of alterations occurring in the six prototypic and representative SCF complex member genes (SKP1, CUL1, RBX1, SKP2, FBXW7 and FBXO5) across various cancers, TCGA data [35][36][37] were assessed as detailed in Materials and Methods. Briefly, mutational data, including frameshift deletions/insertions, fusions, in-frame deletions/insertions, missense mutations (benign and damaging), nonsense mutations and splice site/regions mutations were extracted from 10 solid tumor types, and analyzed ( Figure 1A), with the respective mutational frequencies provided in Table S1. As shown in Figure 1A (left), each cancer type harbors mutations in the six SCF complex member genes that typically range from~2% in ovarian cancer to~38% in uterine cancer. Additionally, cursory analyses revealed that the core SCF complex member genes typically harbor fewer than 20 mutations within a given cancer type ( Figure 1A; right), with specific genes, namely SKP1 and RBX1 having fewer than 10 mutations in each cancer type. Conversely, CUL1 exhibits a higher mutational load in some cancers, as evidenced by the 23 and 35 mutations observed in stomach and uterine cancers, respectively. Similarly, the three F-box protein genes typically harbor few mutations, (1-20 total mutations) within most cancer types ( Figure 1A; right); however, FBXW7 is the most frequently mutated gene with 107 and 117 cases in colorectal and uterine cancers, respectively. Collectively, these data reveal that SCF complex member genes are mutated in a diverse array of cancer types with FBXW7 being the most frequently mutated gene in many cases.
Next, to determine whether SCF complex member genes predominantly exhibit specific mutational subtypes (i.e., frameshift deletion/insertion, fusion, in-frame deletion/insertion, missense, nonsense or splice site/regions), TCGA sequencing data [35][36][37] were scrutinized for each of the six genes. The mutational frequencies and the predicted functional impacts of the single nucleotide polymorphisms (SNPs) ("benign", "possibly damaging" and "probably damaging") were determined using Sorting Intolerant From Tolerant (SIFT) and Polymorphism Phenotyping V2 (PolyPhen-2) (detailed in Materials and Methods). Unfortunately, PolyPhen-2 and SIFT score estimates are derived from protein alignments utilizing different protein databases, which sometimes results in inconsistent predictions between approaches. Accordingly, only PolyPhen-2 data were employed for mutation classification analyses, where missense mutations with "not available" (N/A) prediction scores were excluded from all downstream analyses. In general, the overall (Total) distributions of the various mutation categories ( Figure 1B) are predominantly enriched for "probably damaging" mutations (42.4%), followed by "benign" (20.6%) and nonsense (14.8%) mutations, while individual genes are more variable. For example, SKP1 predominantly harbors missense "benign" mutations (47.8%), while other mutations (i.e., missense "probably" damaging; 8.7%) are less common. CUL1 exhibits numerous missense mutations (36.6% "probably damaging"; 32.41% "benign"), while a majority of SKP2 mutations are missense "benign" mutations (47.4%). In contrast, FBXW7 SNPs are primarily (A) Bar graph (left) and heatmap (right) displaying the frequency and prevalence, respectively, of SCF complex member gene mutations in 10 cancer types with the total number of cancer cases indicated within brackets. Alteration categories include frameshifts, fusions, in-frame deletions/insertions, missense, nonsense and splice site mutations. (B) Bar graph presenting the frequency of the alteration categories from 10 cancer types, with the aggregate total frequencies provided in the last column (Total). The total number of alterations identified for each gene are indicated within brackets.
Next, to determine whether SCF complex member genes predominantly exhibit specific mutational subtypes (i.e., frameshift deletion/insertion, fusion, in-frame deletion/insertion, missense, nonsense or splice site/regions), TCGA sequencing data [35][36][37] were scrutinized for each of the six genes. The mutational frequencies and the predicted func- Bar graph (left) and heatmap (right) displaying the frequency and prevalence, respectively, of SCF complex member gene mutations in 10 cancer types with the total number of cancer cases indicated within brackets. Alteration categories include frameshifts, fusions, in-frame deletions/insertions, missense, nonsense and splice site mutations. (B) Bar graph presenting the frequency of the alteration categories from 10 cancer types, with the aggregate total frequencies provided in the last column (Total). The total number of alterations identified for each gene are indicated within brackets.

The Distributions of Encoded SCF Complex Alterations Are Consistent with a Tumor Suppressor Mutational Load
To gain a greater insight into the overall distribution of encoded missense, truncating and splice site alterations within the SCF complex members, the frequency and location (i.e., amino acid position) of encoded alterations were assessed in 10 cancer types ( Figure 2; Table S2). In general, all SCF complex members exhibit alterations that span the length of the encoded protein, an overall distribution that is consistent with a tumor suppressor mutational load [38]. Interestingly, however, several alteration hot-spots were observed in FBXW7, a feature that is more commonly associated with oncogenes [38] (Figure 2). For example, encoded alterations involving R465C/H/G occur in 58 cancer patient samples (bladder, breast, colorectal, head and neck, stomach and uterine). Similarly, the encoded alterations R479Q/G/L/P occur in 22 patient samples (bladder, breast, colorectal, head and neck, pancreatic, stomach and uterine cancers). R505G/C/H/L substitutions account for 39 of the encoded alterations, in colorectal, head and neck, ovarian, stomach and uterine cancers, whereas D520N/Y/H/E occurs in 22 bladder, colorectal, head and neck and ovarian patient cases. Notably, the alteration hotspots detailed above reside within the WD-40 domain of FBXW7, which in some cases may adversely impact protein structure (e.g., D520; discussed further below). Taken together, these data suggest that FBXW7 (and perhaps other SCF complex members) may exhibit tumor suppressor and/or oncogene-like functions in a context/tissue-dependent manner.

Encoded Alterations in SCF Complex Members May Adversely Impact Protein Structure
Having described the frequency and location of encoded alterations in the preceding sections, we next sought to explore the potential impact these changes have on the structure and function by analyzing crystallographic data extracted from The Protein Data Bank (PDB) [31,39]. Mutations deemed "deleterious" or "possibly/probably damaging" by SIFT and Polyphen-2 (Table S2), respectively, are presented in Figures 3 and 4. SIFT deems an alteration "deleterious" based on conserved amino acid positions, where alterations at strongly conserved positions are expected to be intolerant to most substitutions [40]. By contrast, PolyPhen-2 predictions are based on the stability and function of human proteins using functional annotation of SNPs, maps coding SNPs to gene transcripts, extracts protein sequence annotations and structural attributes [41]. Importantly, these specific amino acid substitutions could critically impact SCF complex formation and function by disrupting secondary structure, tertiary structure and protein-protein interactions.     (Table S2) and are predicted to impact protein-protein interactions between SKP1 and CUL1 or SKP1 and SKP2. Amino acids are denoted by their single letter code, numbers indicate amino acid position in the SKP1 protein. Colored arrows identify cancer type in which the underlying mutation was identified (see key). (B) Partial crystal structure (missing amino acids 1-16; ribbon model) of the CUL1 (green)-RBX1 (brown) complex presents predicted detrimental alterations. Indicated alterations are deemed deleterious, possibly/probably damaging by SIFT and PolyPhen-2 and are predicted to impact protein-protein interactions between CUL1-RBX1 or CUL1-SKP1. (C) Partial crystal structure (missing amino acids 1-18; ribbon model) of the CUL1 (green)-RBX1 (brown) complex presents predicted detrimental alterations. Indicated alterations are deemed deleterious, possibly/probably deleterious by SIFT and PolyPhen-2 and are predicted to impact protein-protein interactions between RBX1-CUL1 or RBX1-E2 conjugase.  terations within this region may impact the ability of SKP2 to recognize and ubiquitinate protein targets. Substitutions in the leucine-rich repeat include the elimination of arginine residues (positively charged aliphatic chain) and consist of R164C (lung), R182C (colorectal; skin; uterine) and R234Q (colorectal). Other alterations in this domain include S203F (bladder), P166Q and G239W (uterine). Importantly, amino acid substitutions within SKP2 have the potential to impede normal function which can result in the aberrant accumulation of its substrate proteins.  (Table S2) and are predicted to impact protein-protein interactions between SKP1 and SKP2 or SKP2 and its target proteins. Amino acids are denoted by their single letter code, numbers indicate amino acid position in the SKP2 protein. Colored arrows identify cancer type in which the underlying mutation was identified (see key). (B) Partial crystal structure (missing amino acids 1-262; ribbon model) of the SKP1 (green)-FBXW7 (brown) complex presents predicted damaging alterations. Indicated alterations are deemed deleterious, possibly/probably damaging by SIFT and PolyPhen-2 and are predicted to impact protein-protein interactions between SKP1 and FBXW7 or FBXW7 and its substrate proteins.
FBXW7 is an F-box protein with a WD-40 protein domain that is critical for substrate specificity and whose target proteins includes cell cycle regulators and oncoproteins such as Cyclin E1 and c-MYC [26,46,47]. Unlike SKP2, there are "deleterious" and "possibly/probably damaging" substitutions in all 10 cancer types assessed, with the highest incidences occurring in breast, head and neck and uterine cancers. Substitutions within  (Table S2) and are predicted to impact protein-protein interactions between SKP1 and SKP2 or SKP2 and its target proteins. Amino acids are denoted by their single letter code, numbers indicate amino acid position in the SKP2 protein. Colored arrows identify cancer type in which the underlying mutation was identified (see key). (B) Partial crystal structure (missing amino acids 1-262; ribbon model) of the SKP1 (green)-FBXW7 (brown) complex presents predicted damaging alterations. Indicated alterations are deemed deleterious, possibly/probably damaging by SIFT and PolyPhen-2 and are predicted to impact protein-protein interactions between SKP1 and FBXW7 or FBXW7 and its substrate proteins.
Recall that SKP1 is an invariable member of the SCF complex that recruits the variable F-box proteins and their protein ligands [31]. Although SKP1 alterations occur most frequently in uterine and lung cancers, most substitutions only occur in a single cancer type, except for E133K/Q (bladder, breast; Figure 3A). Notably,~50% of the encoded alterations deemed "deleterious" and "probably/possibly damaging" occur within the F-box protein recognition domain and affect net charge or alpha helicity including E133K/Q (bladder; breast), K128Q (uterine), R126C (colorectal) and A106G (uterine). CUL1 is the scaffolding member of the SCF complex [29] and alterations that potentially affect its ability to bind with either SKP1 or RBX1 could critically impact SCF complex formation and function. As with SKP1, most CUL1 alterations only occur within a given cancer type with the exceptions of E485K (bladder; head and neck; pancreatic; uterine) and Q607H/K (bladder; skin) ( Figure 3B). Strikingly,~50% of alterations involve glutamate residues that are frequently converted to lysine residues, resulting in a change in net charge and include E420K (bladder), E485K (uterine), E493K (head and neck), E733K (colorectal) and E457Q (lung). Only five RBX1 alterations were identified as "deleterious" or "possibly/probably damaging" across all patient samples including S62F (uterine), F81V (bladder), R86H (colorectal), L88F (skin) and T90I (head and neck) ( Figure 3C). These alterations occur within the RBX1 zinc finger binding domain, which may impact interactions between RBX1 and the E2 enzyme or CUL1. Finally, FBXO5 is an F-box protein whose substrates remain largely unknown and has many "deleterious" and "possibly/probably damaging" alterations identified within colorectal and uterine cancers. Approximately half of the substitutions in colorectal cancer convert a leucine residue to a larger residue such as L205F, L220R and L259F. Conversely, there is a wide range of FBXO5 alterations in uterine cancer that includes a serine to glycine conversion within the F-box motif (S278G), which could potentially impact protein-protein interactions with SKP1 and the rest of the SCF complex. Unfortunately, there is no available crystal structure of SKP1-FBXO5 to date, therefore it is difficult to speculate how these alterations may impact either substrate interactions or SCF complex formation and function.
SKP2 is an F-box protein that has been primarily classified as an oncoprotein and targets the products of proto-oncogenes such as Cyclin E1 and c-MYC for proteolytic degradation [7,26,42,43]. SKP2 mutations occur most frequently in bladder, colorectal, skin and uterine cancers and result in key amino acid substitutions, including L105F (uterine) and S135C (bladder; Figure 4A), that are located within the F-box motif, which is critical for interactions between SKP2 and the SCF complex [44]. Another important SKP2 protein domain is the leucine-rich repeat, consisting of a hydrophilic cap and a hydrophobic concave surface that are essential for ligand interaction [45]. Therefore, non-conservative alterations within this region may impact the ability of SKP2 to recognize and ubiquitinate protein targets. Substitutions in the leucine-rich repeat include the elimination of arginine residues (positively charged aliphatic chain) and consist of R164C (lung), R182C (colorectal; skin; uterine) and R234Q (colorectal). Other alterations in this domain include S203F (bladder), P166Q and G239W (uterine). Importantly, amino acid substitutions within SKP2 have the potential to impede normal function which can result in the aberrant accumulation of its substrate proteins.
FBXW7 is an F-box protein with a WD-40 protein domain that is critical for substrate specificity and whose target proteins includes cell cycle regulators and oncoproteins such as Cyclin E1 and c-MYC [26,46,47]. Unlike SKP2, there are "deleterious" and "possibly/probably damaging" substitutions in all 10 cancer types assessed, with the highest incidences occurring in breast, head and neck and uterine cancers. Substitutions within the FBXW7 F-box motif include E287V/K (colorectal; lung; stomach), which induces a change in net charge from negative to positive and L301P (breast), which is predicted to disrupt alpha helicity ( Figure 4B). As with SKP2, there is a trend for arginine substitutions to occur across a diverse array of cancer types (bladder; colorectal; head and neck; stomach; uterine). These positively charged hydrophilic residues often protrude from the protein surface to participate in hydrogen bonding Van der Waals interactions in addition to aiding in salt bridge formation [48,49]. Therefore, arginine substitutions could critically impact the ability of the SCF complex to recognize substrates and regulate the abundance of target proteins. Notably, three of the four arginine substitutions are conversions to glycine residues, all occurring in close spatial proximity of one another (40 residues), at the exterior of the WD-40 ligand binding domain, which effectively eliminates a net positive charge by replacing it with a small hydrophobic residue. Additional substitutions in the WD-40 domain include changes in net charge (E693K; ovarian), gain of a hydrogen bonding group (G411S; pancreatic), or both (D520N/Y/H/E; bladder; colorectal; head and neck; ovarian). Collectively, these encoded amino acid substitutions are predicted to adversely impact the F-box proteins from interacting with the SCF complex and/or their protein substrates, which is expected to promote the aberrant accumulation of substrates and contribute to early disease development in certain instances.

SCF Complex Members Exhibit Frequent Copy Number Alterations in Cancer
While the preceding sections focused on specific mutations and their encoded amino acid substitutions within the six SCF complex members, we sought to investigate the prevalence of gene CNAs, which are expected to impact their overall expression levels. Like the previous sections, TCGA data [35][36][37] from 10 cancer types were scrutinized for CNAs including deep deletions, shallow deletions, gains and amplifications as described within Materials and Methods, with the full list of CNAs provided in Table S3. In general, CNAs occur frequently in all 10 cancer types and collectively range from~19% (pancreatic) to~85% (ovarian) ( Figure 5A). More specifically, SKP1 and RBX1 predominantly exhibit shallow deletions that range from~12% (pancreatic) to~46% (bladder) and~7% (skin) to~79% (ovarian), respectively. In contrast, copy number gains occur most frequently with CUL1 and range from~13% (uterine) to~50% (colorectal), whereas amplifications are rare, ranging from~1.5% (lung) to~7% (ovarian). SKP2 also exhibits amplifications ranging from~0.5% (colorectal) to~8% (lung) and a prevalence of gains ranging from 12% (uterine) to~46% (lung). Conversely, FBXW7 and FBXO5 typically harbor shallow deletions ranging from~15% (pancreatic) to~66% (ovarian) and~7% (uterine) to~59% (ovarian), respectively. Collectively, these data show that CUL1 and SKP2 predominantly exhibit copy number gains (gains and amplifications), whereas SKP1, RBX1, FBXW7 and FBXO5 tend to exhibit more copy number losses (shallow and deep deletions). Importantly, these data lend further support to the possibility that aberrant expression of SCF complex member genes adversely impacts complex function and may contribute to the development and/or progression of many cancer types.
It should also be noted that there is a compounding effect when the individual frequencies of shallow deletions and gains are concurrently assessed for the six genes. That is, the combined frequencies are >50% for losses and gains in breast, colorectal and lung cancers, the three most prevalent cancer types ( Figure 5B). More specifically, the combined frequencies of shallow deletions are 54% (colorectal), 74% (breast) and 84% (lung), while combined frequencies of gains are 55% (breast), 61% (colorectal) and 72% (lung), which suggests that aberrant SCF complex expression (i.e., decreases or increases) may be a significant, yet underappreciated contributor to disease pathology in these cancer types. In support of this possibility, mRNA expression levels are significantly altered (reduced or increased) in a manner consistent with the specific CNA (i.e., loss or gain). For example, reduced mRNA expression is typically associated with copy number losses (deep and shallow deletions) for SKP2, FBXW7 and FBXO5 relative to the diploid state, while increased expression typically occurs with copy number gains (gains and amplifications) ( Figure 6). Consistent with these findings, CNAs involving SKP1, CUL1 and RBX1 also correspond with overall changes in mRNA expression levels. Collectively, these data show that CNAs and corresponding changes in mRNA expression occur frequently for SCF complex member genes, highlighting how CNAs likely adversely affect SCF complex expression and/or function.

SCF Complex Members Are Differentially Methylated in Cancer
To assess the methylation status of SCF complex member genes, β-values and corresponding CNA data were exported from the TCGA Firehose Legacy dataset (https: //gdac.broadinstitute.org/, accessed on 20 October 2021) using cBioPortal [35,36] (Figure 7; Tables S4-S9). Samples were characterized as hypomethylated (β < 0.2), partially methylated (0.2 ≤ β ≤ 0.7), or hypermethylated (β > 0.7) as described within Materials and Methods. Overall, SCF complex members exhibit diverse methylation profiles across the 10 cancer types. As expected, the methylation status of SKP1 varies by cancer type, with SKP1 being hypomethylated (i.e., expressed) in the majority of colorectal, head and neck, ovarian, stomach and uterine cases, whereas it tends to be partially or hypermethylated (i.e., repressed) in bladder, breast, lung and pancreatic cancers. Note that for all cancer types investigated, samples exhibiting SKP1 copy number losses also tend to correspond with hypermethylated states (i.e., greater β-values) relative to diploid samples or those harboring copy number gains (Figure 7, Table S4). Overall, the methylation status of CUL1 is very similar to that of SKP1, as tumor samples with copy number losses also tend to be hypermethylated; however, CUL1 is generally partially methylated or hypermethylated in all uterine cancer cases (Figure 7, Table S5). Interestingly, RBX1 (Figure 7, Table S6) and SKP2 (Figure 7, Table S7) tend to be hypomethylated across all cancer types, irrespective of copy number status, whereas FBXW7 is differentially methylated across cancer types (Figure 7, Table S8). Briefly, FBXW7 is hypomethylated in colorectal, head and neck, ovarian, stomach and uterine patient samples and partially or hypermethylated in bladder, breast, lung, pancreatic and skin cancers. Finally, FBXO5 is partially methylated in bladder and lung cancers, but is predominantly hypomethylated in all remaining cancer types (Figure 7, Table S9). As methylation is a negative regulator of gene expression [50], increases in methylation status may phenotypically mimic copy number losses and/or result in loss of heterozygosity, whereas aberrant decreases in methylation status are expected to correspond with increases in gene expression, which may phenocopy, to a limited extent, copy number gains. Thus, aberrant increases or decreases in methylation status may adversely impact SCF complex members harboring tumor suppressor [9,10,51,52] or oncogene-like functions [53,54], respectively. Collectively, these data show that SCF complex member genes are differentially methylated across a variety of cancer types and supports the possibility that changes in methylation status may induce aberrant SCF complex function and contribute to cancer pathogenesis.  Figure 6). Consistent with these findings, CNAs involving SKP1, CUL1 and RBX1 also correspond with overall changes in mRNA expression levels. Collectively, these data show that CNAs and corresponding changes in mRNA expression occur frequently for SCF complex member genes, highlighting how CNAs likely adversely affect SCF complex expression and/or function.

SCF Complex Members are Differentially Methylated in Cancer
To assess the methylation status of SCF complex member genes, β-values and corresponding CNA data were exported from the TCGA Firehose Legacy dataset (https://gdac.broadinstitute.org/, accessed on 19 December 2021) using cBioPortal [35,36] (Figure 7; Tables S4-S9). Samples were characterized as hypomethylated (β < 0.2), partially methylated (0.2 ≤ β ≤ 0.7), or hypermethylated (β > 0.7) as described within Materials and Methods. Overall, SCF complex members exhibit diverse methylation profiles across the 10 cancer types. As expected, the methylation status of SKP1 varies by cancer type, with SKP1 being hypomethylated (i.e., expressed) in the majority of colorectal, head and neck, ovarian, stomach and uterine cases, whereas it tends to be partially or hypermethylated (i.e., repressed) in bladder, breast, lung and pancreatic cancers. Note that for all cancer types investigated, samples exhibiting SKP1 copy number losses also tend to correspond with hypermethylated states (i.e., greater β-values) relative to diploid samples or those harboring copy number gains (Figure 7, Table S4). Overall, the methylation status of CUL1 is very similar to that of SKP1, as tumor samples with copy number losses also tend to be ). SKP1, CUL1, RBX1, SKP2, FBXW7, and FBXO5 CNAs (deep deletions; shallow deletions; gains, amplifications) and diploid cases are presented along the x-axis with total case numbers indicated within brackets. Note that categories with ≤ 2 cases are identified by dots and that in general, deep deletions, and amplifications are rare. Note that some of the y-axes are differentially scaled to better present the data.

Discussion
In this study, we examined the genetic and epigenetic changes associated with six prototypic SCF complex member genes across 10 common cancer types. First, we determined that each member is somatically mutated across multiple cancer types, with FBXW7 exhibiting the most frequent number of mutations. Additionally, we determined that the mutations are typically distributed across the entirety of their respective coding regions, which is consistent with the mutational distribution of a tumor suppressor gene [38]; however, FBXW7 also harbored focal hot-spots that are consistent with the mutational profile of an oncogene [38]. Interestingly, in silico analyses determined that many of the encoded amino acid substitutions are predicted to adversely impact protein structure and/or function of the SCF complex, supporting a potential role in cancer pathogenesis. Next, CNA analyses revealed that each gene is frequently altered in 10 common cancer types, and that SKP1, RBX1, FBXW7 and FBXO5 tend to exhibit more losses, while CUL1 and SKP2 exhibit more gains. It is also important to note that the specific categories of CNAs corresponded with similar changes in gene expression at the mRNA levels. That is, cancers with deep or shallow deletions exhibited reduced mRNA expression levels relative to diploid cases, while those harboring gains or amplifications exhibited increases. Finally, examination of their DNA methylation profiles revealed a large range of patterns for each gene. In general, samples harboring copy number losses (particularly shallow deletions) were frequently hypermethylated relative to the diploid cases or those with copy number gains (gains or amplifications). In summary, these findings determined that the six SCF complex member genes exhibit frequent mutations, CNAs and/or aberrant methylation profiles that collectively are predicted to adversely impact complex member expression and/or function that is consistent with an etiological role in cancer development and progression.
When assessing the amino acid substitutions in conjunction with available crystal structure data, our results highlight the potential functional impact that encoded alterations may have on the SCF complex and identifies a possible mechanism by which aberrant SCF complex structure and function may contribute to cancer pathogenesis. Importantly, additional functional studies are highly warranted as not every mutation (and encoded alteration) will have pathogenic implications in cancer; however, it is well established that specific amino acid substitutions can adversely impact protein structure and function, and thus have the potential to contribute to disease pathogenesis. For example, the mutations we identified that alter key amino acid characteristics, including net charge, hydrogen bonding ability and steric interactions (i.e., protein-protein interactions) within the leucinerich repeat domain of SKP2 or the WD-40 domain of FBXW7 can decrease the affinity and specificity of these binding sites for their protein substrates, including the oncoproteins Cyclin E1 and c-MYC [26,47,55]. SKP2 also targets P27 (CDKN1B) for proteolytic degradation, a cell cycle regulating protein that inhibits Cyclin Dependent Kinase activity [56]. Importantly, aberrant P27 accumulation is associated with chromosome instability (CIN; ongoing changes in chromosome numbers) and mitotic defects [2,53,57], whereas aberrant Cyclin E1 and c-MYC turnover lead to cell cycle and apoptotic defects that promote cancer development and progression [9,10,12,[58][59][60][61]. In fact, increased c-Myc expression in mouse embryonic fibroblasts increases growth rate, enhances clonogenic growth, decreases in contact inhibition and spontaneously forms tumors in mice within 10 days [62], providing strong evidence that deregulated c-MYC expression exhibits a significant role in tumorigenesis. Importantly, while the c-MYC protein is overexpressed in ∼70% of human cancers, typically only~20% of tumors harbor c-MYC gene amplifications or translocations and thus it remains possible that aberrant c-MYC turnover/degradation may account for this discrepancy [63]. Thus, these findings highlight the importance of c-MYC regulation at the protein level and the possible malignant consequences of abnormal SCF complex function. Moreover, the detrimental impact encoded alterations have on protein structure and function is perhaps best exemplified by Wang and colleagues [55], who determined that D331 within the leucine-rich repeat domain of SKP2 is essential for its interaction with CKS1 (Cyclin-dependent Kinases regulatory protein 1) and the subsequent ubiquitination of P27. More specifically, they noted that a D331A substitution converted the normally negative surface potential to a positive potential, which ablated the SKP2-CKS1 interaction, while a conserved substitution, D331E, maintained the interaction. Similarly, amino acid substitutions involving R417 and R457 within the WD40 domain of FBXW7 abolish Cyclin E binding, while R495 substitutions reduce binding affinity [47,64]. While none of these alterations were detected within the current study, their results highlight the need and value of downstream genetic and functional studies aimed at assessing the plethora of individual substitutions to accurately determine their impacts on protein-protein interactions and overall SCF complex function. This is particularly important as the SCF complex regulates numerous biological processes required for genome stability, including cell cycle progression, centrosome biology and DNA repair [9,10,12,19,20,65]. Thus, it is conceivable that specific amino acid substitutions occurring in key protein motifs may adversely impact SCF complex function and underlie the accumulation of protein substrates, especially (proto-) oncogenes whose increase in abundance promotes genome instability and contributes to cancer pathogenesis [9,10,12,19,20]. For example, CCNE1 (Cyclin E1) is an established oncogene [58,66,67] that is genomically amplified in many cancer types [35][36][37] and whose overexpression induces CIN, which is associated with cell cycle misregulation, genome instability, cellular transformation and tumor formation in mice [59]. More specifically, Karst and colleagues [67] determined that increased abundance of Cyclin E1 leads to inappropriate cell growth including accelerated growth, loss of contact-inhibition, clonogenic growth and anchorage independent growth. Moreover, we recently demonstrated that reduced expression of SKP1, CUL1 and RBX1 prevents Cyclin E1 degradation leading to an increase in abundance that induces CIN and promotes cellular transformation in colorectal and ovarian cancer contexts [9,10,12]. As the three core SCF complex members interact in an epistatic manner, it is reasonable to assume that specific alterations to any of these proteins will adversely impact complex formation and lead to abnormal increases in Cyclin E1 abundance. Additionally, it was also shown through the use of genetic rescue experiments that the aberrant increases in Cyclin E1 observed following FBXW7 inactivation in colorectal cancer cells were an underlying mechanism driving increases in CIN [68]. Importantly, SKP2 and FBXW7 regulate Cyclin E1 turnover [2,18,25,46] and thus specific SKP2 and FBXW7 mutations, CNAs or altered methylation (i.e., hypermethylation) status are predicted to adversely impact Cyclin E1 abundance and contribute to disease pathogenesis. Thus, it is the collective lack of Cyclin E1 turnover arising from defects in the expression and/or function of SCF complex members that is expected to phenocopy genomic amplification of CCNE1 and contribute to cancer pathogenesis. Given estimates that the SCF complex targets hundreds to thousands of proteins, it is possible that the aberrant regulation (i.e., turnover) of additional protein substrates, such as P27, RAD51, c-MYC may also contribute to cancer pathogenesis [3][4][5]7,43,69].
While TCGA data reveal that CNAs of the six SCF complex member genes are common in cancer and are either predominantly gained or lost, it is important to highlight the phenotypic consequences that these CNAs may have in cancer irrespective of their traditionally established oncogenic or tumor suppressive roles. Previous genetic studies have shown that heterozygous knockout of SKP1, CUL1, RBX1 and FBXO7 induces CIN [9][10][11], an enabling hallmark of cancer [70] frequently associated with cellular transformation, drug resistance, metastasis and poor patient prognosis in many cancer types [71][72][73][74][75]. Indeed, CIN is a dynamic form of genome instability that drives ongoing changes in genetic and cell-to-cell heterogeneity that is proposed to contribute to cancer development by increasing the rate at which oncogenes, tumor suppressor genes and cell cycle regulators are gained, lost, or altered [76][77][78]. Importantly, the phenotypic consequences corresponding with cancer development (e.g., CIN) following aberrant expression of SCF complex members may vary and are likely to arise via the misregulation of protein substrates in a contextdependent manner. For example, overexpression of SCF complex members may result in excess degradation of protein substrates with tumor suppressor functions, and has been associated with cancer cell stemness, tumor progression and worse patient survival [79][80][81][82][83][84][85][86].
Similarly, amplification and overexpression of SKP2 promotes the increased degradation of P27 and disease progression [56] and is associated with worse survival outcomes and poor response to therapy in numerous cancer types [87][88][89], contributing to SKP2 being traditionally classified as an oncogene. Conversely, other studies have shown that reduced SKP2 expression leads to aberrant increases in P27 abundance that is suggested to prevent mitotic entry and result in increased nuclear areas [2], a CIN-associated phenotype [70,[90][91][92] suggestive of large increases in DNA/chromosome content (i.e., polyploidy) [69]. In contrast, FBXW7 has been primarily classified as a tumor suppressor gene [51,52] and loss of SCF FBXW7 activity corresponds with aneuploidy, micronucleus formation (extranuclear bodies found outside the primary nucleus and a hallmark of CIN), sister chromatid cohesion defects and chromosome segregation defects (i.e., CIN) [68,93]. However, our data support the possibility that FBXW7 may also exhibit oncogene-like roles as gains, amplifications and mutational hot-spots were observed [38]. Indeed, Galindo-Moreno and colleagues [94] determined that FBXW7 amplification in tumors harboring wild-type TP53 expression reduced breast cancer patient survival but did not impact survival of patients with skin or bladder cancers. Accordingly, these collective data highlight that losses or gains of FBXW7 (and perhaps other SCF complex member genes), may adversely contribute to cancer pathogenesis in a context-dependent manner [95]. With respect to FBXO5, Marzio et al. [3] recently determined that reduced FBXO5 expression corresponds with a DNA re-replication phenotype (i.e., endoreduplication) in a breast cancer model and that loss of its F-box dependent function leads to increases in RAD51 abundance (a homologous recombination repair protein). In contrast, Vaidyanathan and co-workers [96] determined that FBXO5 overexpression corresponds with increases in mitotic defects (e.g., lagging and incorrect segregation of chromosomes) and aneuploidy that promotes CIN in transgenic mouse models. Additionally, it is important to note that while each F-box protein targets a distinct array of protein substrates, there is some functional redundancy between F-box proteins, in that multiple F-box proteins can share specific substrates. For example, while SKP2 and FBXW7 exhibit distinct substrate profiles, both target Cyclin E1 and c-MYC for proteolytic degradation [2,7,26,43]. Thus, this inherent functional redundancy may serve as a compensatory mechanism to limit the functional outcomes associated with the aberrant expression and/or function of a given F-box protein. Therefore, with the knowledge that 69 distinct SCF complexes are predicted to target hundreds to thousands of protein substrates, aberrant expression of the core SCF complex members are expected to evoke greater phenotypic consequences than alterations involving the individual F-box proteins. In this regard, future functional studies are required to determine the impacts aberrant expression and function of each individual SCF complex member have on protein substrates, genome instability and disease pathogenesis are now highly warranted.
In summary, this study revealed that the six SCF complex member genes predominantly exhibit missense damaging mutations, a subset of which likely impact SCF complex function by disrupting protein-protein interactions between SCF complex members and their protein substrates. Further, the distribution of the encoded amino acid substitutions and CNA data support the possibility that SCF complex members may exhibit both tumor suppressive or oncogenic roles that will likely be disease and context dependent. Finally, we determined that SCF complex members tend to be differentially methylated across multiple cancer types, which may phenotypically mimic certain CNAs (e.g., shallow deletions) and contribute to cancer pathogenesis but in an epigenetic manner. Ultimately, these alterations may underlie the aberrant accumulation of protein substrates that are implicated in CIN [2,9,10,12,59] and cancer pathogenesis [3,7,26,67,97,98]. Unfortunately however, the complete spectrum of target specificities of all 69 F-box proteins and their implications for disease development remain largely unknown or poorly characterized. Accordingly, functional studies are now required to determine the specific outcomes driven by aberrant expression and/or function of each SCF complex member gene and its impact on substrate accumulation to better understand the molecular determinants giving rise to cancer development, which will be critical to develop innovative precision medicine strategies to better combat the disease. For instance, as aberrant SCF complex function is associated with CIN [9,10,12], developing biomarkers to detect abnormal SCF complex expression could aid early diagnosis in cancers that exhibit high levels of CIN, including colorectal [99] and ovarian cancers [100,101], where early disease detection will be critical to ultimately improve patient outcomes [102,103]. Thus, future studies need to be designed that compile and analyze patient sample datasets for mutations, CNAs and methylation changes with matched normal samples and patient outcomes to gain a comprehensive understanding of how genetic alterations are associated with disease and their impact on patient outcomes. In addition, studies aimed at investigating the pathogenic implications of SCF complex alterations are needed to gain a greater understanding of the etiological events contributing to oncogenesis.

Assessing the Frequency, Distribution and Predicted Functional Impact of Encoded SCF Complex Mutations
SCF complex member SNPs were evaluated to assess the frequency and the predicted functional impact of encoded alterations. Mutational frequencies were calculated as follows: total mutations within a given mutation class (n)/total number of mutations (N) for each gene × 100%. SNPs were assessed using SIFT (https://sift.bii.a-star.edu.sg/, accessed on 9 November 2021) [104] and PolyPhen-2 (http://genetics.bwh.harvard.edu/pph2/) [105]; Note that only single amino acid substitutions can be assessed using PolyPhen-2 and SIFT, where the remaining alterations (e.g., fusions, splice sites) are denoted with not applicable (N/A). For reference purposes, SIFT classifies SNPs as tolerated or deleterious [104], whereas PolyPhen-2 classifies SNPs as benign, possibly damaging or probably damaging [105]. Conceptually, SIFT assesses the conservation of an amino acid position (i.e., sequence identity/similarity) across species, where substitutions occurring at highly conserved positions that are deemed "deleterious" [40]. By contrast, PolyPhen-2 predicts the impact of substitutions on the stability and function of proteins using functional annotation of SNPs, maps coding SNPs to gene transcripts, extracts protein sequence annotations and structural attributes to create a score ranging from 0.0 (benign) to 1.0 (damaging) to ultimately assign it as "benign", "possibly damaging" or "probably damaging". A prediction of "possibly damaging" means that the substitution is predicted to be damaging, but with low confidence [41]. To display the spatial distribution and category of the encoded alterations (e.g., missense, truncating, splice or fusion), lollipop diagrams were extracted from cBioPortal [35][36][37] and alterations were classified as: (1) missense (unknown significance); (2) missense (putative driver); (3) truncating (unknown significance); (4) truncating (putative driver); (5) splice (unknown significance); or (6) or splice (putative diver). Figures presenting mutational data were generated with Prism v9 (GraphPad, San Diego, CA, USA) and assembled in Photoshop 2022 (Adobe, Toronto, ON, Canada).

Gene Copy Number Alteration and mRNA Expression Analyses
Gene CNA data were extracted from TCGA Pan-Cancer Atlas [37] with CNAs identified using the following Onco Query Language commands in cBioPortal [35,36]: (1) HOMDEL, deep deletion (i.e., loss of 2 alleles); (2) HETLOSS, shallow deletion (i.e., loss of 1 allele); (3) GAIN, small gain (i.e., gain of 1 allele); and (4) AMP, large amplification (i.e., gain of ≥ 2 alleles). OncoPrint and mRNA expression data were retrieved using cBioPortal (no later than 10 November 2021) to identify the individual and cumulative frequencies of CNAs (shallow deletions and gains) which were compared with their respective mRNA expression z-scores (z = expression in tumor sample-mean expression in reference sample[diploid])/standard deviation of expression in reference sample[diploid]) for breast, colorectal and lung cancers. Figures presenting CNA data were generated in Prism and assembled in Photoshop.

Assessing the Methylation Status of SCF Complex Members
The methylation status of each gene was assessed using beta (β) values and corresponding CNA data and extracted from TCGA Firehose Legacy (https://gdac.broadinstitute. org/) for each cancer type. Briefly, β-values were calculated as follows: β = M/M + U, where M > 0 and U > 0 represent the methylated and unmethylated signal intensities measured by the Illumina 27k (ovarian serous cystadenocarcinoma) or 450k BeadChip arrays (all other cancer types), respectively [108]. For each gene, samples were characterized as being hypomethylated (β < 0.2), partially methylated (0.2 ≤ β ≤ 0.7), or hypermethylated (β > 0.7). All data were accessed no later than 10 November 2021.  Institutional Review Board Statement: Ethical review and approval were waived for this study, due to the use of publicly available datasets from TCGA Research Network (https://www.cancer.gov/ tcga) [37] and Oncomine (https://www.oncomine.org) [35,36], accessed on 18 November 2021. No other human specimens or clinical data were utilized.
Informed Consent Statement: All patient data presented in Figures 1-7 are from TCGA Pan-Cancer Atlas and Firehose Legacy datasets made publicly available through cBioPortal (https: //cbioportal.org) and Broad Institute (https://gdac.broadinstitute.org/), accessed on 18 November 2021, respectively, and for which informed consent was previously obtained by TCGA.